T ♦ 

MP RIMIKGGVWRNTEDEILKAAVMKYGKNQWSRIASLLHRKSAKQCKARW 50 

♦ T 

YEWLDPSIKKTEWSREEEEKLLHLAKLMPTQWRTIAPIIGRTAAQCLEHY 100 
~ "~ ▼ 

EFLLDKAAQRDNEEETTDDPRKLKPGEIDPNPETKPARPDPIDMDEDELE 150 

MLSEARARLANTOG KKAKRKAREK QLEEARRLAAL OKRRELR AAGIEIQK 200 

▼ 

KRKRKRGVDYNAEIPFEKKPALGFYDTSEENYQALDADFRKLRQQDLDGE 250 

▼♦ ■ ♦ ■ 

LRSEKEGRD RKKDKQHLKRKK ESDLPSAILQTSGVSEFTKKRSKLVLPAP 300 

▼ T 

QISDAELQEVVKVGQASEIARQTAEESGITNSASSTLLSEYNVTNNSVAL 350 

♦ • ▼ 

RTPRTPASQDRI LQEAQNLMALTNVDTPLKGG ^IPi^^DFSGVTPQR^ 400 

• ▼ ▼ • 

WgTWVLSTPFRTI^ 450 

500 



ELEEREIDDTYIEDAADVDARKQAIRDAERVKEMKRMHKAVQKDLPRPSE 550 

VNETILRPLNVEPPLTDLQKSEELIKKEMITMLHYDLLHHPYEPSGNKKG 600 

T T ▼ 

KTVGFGTNNSEHITYLEHNPYEKFSKEELKKAQDVLVQEMEVVKQGMSHG 650 

ELSSEAYNQVWEECYSQVLYLPGQSRYTRANLASKKDRIESLEKRLEINR 700 

GHMTTEAKRAAKMEKKMKILLGGYQSRAMGLMKQLNDLWDQIEQAHLELR 750 

T ♦ 

TFEELKKHEDSAIPRRLECLKEDVQRQQEREKELQHRYADLLLEKETLKS 800 

KF* 

Fig. 1A 
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5' uncransiaced region: 



CGCACGACAGGAACIC 
CaAOACKCCTGCCSCCAMS 

Coding region: 

1 DMA BXKSZKQ DOIULXM ( ) 



:CC^TCqCTCTTACTACl l lVIVlXjAACCTCCTCTCO G L'lU. , l'l>GC 



1 




Ctataatcaa 


oaooaocata 


.tggaggaata 


ccaaooataa 


aattctaaaa 


51 


qcagcggtaa 


eaaaaeatag 


aaaaaaecag 


taatccaaaa 


ttaeeteatt 


actacataoa 


121 


aaaccaocaa 


aacaotocaa 


aoccaoatoo 


tacoaataoc 


tooatccaao 


cattaagaag 


181 


acaoaatoat 


ccaoaoaaaa 


aoaaoaaaaa 


.ctcttgcact 


-ttaracaaatt 


aatoccaact 


241 


cagsggagga 


ccattactcc 


aatcattooa 


aoaacaocoa 


cccaotactt 


aaaacaccat 



301 qaatttctss 
3 61 cgaaaactta 
421 ccaactgata 



481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 

1141 
1201 
1261 
1321 
1381 

1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 



aatactcagg 

cgtcttgctg. 

aaaagaaaaa . 

gcccttggtt. 

aaattaagac. 

aaaaaagaca. 

caaactagtg 

cagatttcag 

cgtcaaactg 

cacaatgtca 

agaattctgc 

ggtggactta 
gttgtacaga 



iaaaLaaaflC-tgcccaaaga gacaatgaag 
aacctggaga aatagatcca aatccagaaa 
tggatgagga tgaacttgag atgctttctg 

2 MOCLSJUl LOCAiXXATXOB DOHUI < 

gaaagaaggc . . caagaggaaa . gc aagagaga 
ccctccaaaa . aagaagagaa cttcgagcaa 
.gsaagagagg..agct:gat:tat aa 
.ttcatgatac ttccgaggaa. 
a*caggatct . tgatggggag..ctaagatctg 
aacagcattt ..gaaaaggaaa..aaagaatctg 
gtgtttctga atttactaaa aagagaagca 
atgcagaact ccaggaagtt gtaaaagtag 
ccgaggaacc tggcataaca aattctgctt 
ccaacaacag cgttgctctt agaacaccac 
aggaagccca gaacctcatg gccctcacca 

3 ACTXV&KM WmHB ( ) 

at accccatt acataaaaat'aacttctcaa 
! ctccaaacac^^ttctctc^_actcca^tca 



aggaaacaac 
caaaaccagc 
aagccagagc 

i 

aacaattgga. 

ctggcataga. 

.Ccccatttga. 

.ctVttgacgc. 

.aaaaagaagg. 

atttaccatc 

aactagtact 

gccaagcgag 

ccagtacact 

gaacaccagc 

atgtggacac 



agatgaccca 
gcggcctgat 
ccgcttggct 

agaagcaaga 
aattcagaag 
aaaaaagccc 
agatttcagg 
aagagataga 
agctattctt 
tcctgcccct 
tgaaattgca 
tttgcctgag 
ttcccaggac 
cccattgaaa 



qtgtaactcc acagcgacaa 
qgactccttc taatggagct 



gaaggqctqa ctceccqqaq tqqaacaact cccaaaccaq ttattaactc tactccqqqt 
aqaactcctc ttcqaqacaa qttaaacatt aatcccqaqq atqqaatqqc agactatagt 
gatceetctt acgtgaagca qatqqaaaqa gaatcecgag aacatctccg tttagggttg 



ttgggccttc 

gagctggaag 

cgaaagcagg 

gtccagaaag 

gtagaaccgc 

acaatgcttc 

aaaaccgtag 

tatgaaaagt 

gaagtggtta 

tgggaagaat 

aatctggcta 

ggtcacatga 

cttgggggtC 

caaattgaac 

tctgctattc 

gaaaaggaac 

aaattctga 



ctgcccctaa gaatgatttt gaaattgttc taccagaaaa tgccgagaag 
aacgtgaaat agatgatact tacattgaag atgctgctga tgtggatgct 
ccatacgaga tgcagagegt gtaaaggaaa tgaaacgaat gcataaagct 
atctgccaag accaccagaa gtaaatgaaa ctaetctaag acccttaaac 
ctttaacaga tttacagaaa agtgaagaac taatcaaaaa agaaatgatc 
attatgacct tctacatcac ccttatgaac catctggaaa caaaaaaggc 
ggtttggtac caataattca gagcacatta cctatctgga acataaecct 
tctccaaaga agagctgaaa aaggcccagg atgttttggt gcaggagatg 
aacaaggaat gagccatgga gagctctcaa gtgaagctta caaccaggtg 
gctacagtca agttttacat cttcctgggc agagccgcta cacacgggcc 
gtaaaaagga cagaaetgaa tcacttgaaa agaggctcga gataaacagg 
cgacagaagc caagagggct gcaaagatgg aaaagaagat gaaaattttg 
accagtctcg tgctatgggg ctcatgaaac agttgaatga cttatgggac 
aggctcactt ggagttacgc acttttgaag aactcaagaa acatgaagae 
cccggaggct agagtgtcta aaagaagacg ttcagcgaca acaagaaaga 
ttcaacatag atatgctgat ttgctgctgg agaaagagac tttaaagtca 



untranslated region: 

GGACGGflCATTrrAGMTTTXAAOTATTX AOO^TCATTCTmACTAAIIGTCATM^^ 
TTTAAAAAGCCTAATTTTAAAGTGCTOCCTQTSAGTAACTCTTGAATAAAAACAAAW 
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Consensus GATTTAACATAA (SEQ ID NO: 13) 
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